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54-000510US . ST2 5 . txt 
SEQUENCE LISTING 

<110> The scripps Research institute 
Anderson, 3 c 
Schultz, Peter G 
Santoro, Stephen 

<120> COMPOSITIONS OF ORTHOGONAL GLUTAMYL-tRNA AND AMINOACYL-tRNA 
SYNTHETASE PAIRS AND USES THEREOF 

<130> 54-000510PC 

<140> PCT/US 04/021813 
<141> 2004-07-07 

<160> 111 

<170> Patentin version 3.2 

<210> 1 
<211> 78 
<212> RNA 

<213> Aeropyrum pernix 
<400> 1 

gccgcgguag uauagcccgg cccaguaugc gggccucucg agcccgugac ccggguucaa 60 
aucccggccg cggcacca 78 



<210> 2 
<211> 78 
<212> RNA 

<213> Aeropyrum pernix 
<400> 2 

gccgcgguag uauagcccgg cccaguaugc gggccuuucg agcccgugac ccggguucaa 60 
aucccggccg cggcacca 78 



<210> 3 
<211> 78 
<212> RNA 

<213> Archaeoglobus fulgidus 

<400> 3 

gcuccggugg uguagcccgg ccaaucauac gggacucucg aucccgugac ccggguucaa 60 
aucccggccg gagcacca 78 



<210> 4 
<211> 78 
<212> RNA 

<213> Archaeoglobus fulgidus 

<400> 4 

gcuccggugg uguagcccgg ccaaucauuc cggccuuucg agccggcgac ccggguucaa 60 
aucccggccg gagcacca 78 



<210> 5 

<211> 78 

<212> RNA 

<213> Halobacterium sp. 
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<400> 5 

gcuccguugg uguaguccgg ccaaucaucu ugcccucuca cggcaaggac uaggguucaa 60 
aucccugacg gagcacca 78 

<210> 6 
<211> 78 
<212> RNA 

<213> Halobacterium sp. 
<400> 6 

gcucgguugg uguaguccgg ccaaucaugu uggccuuucg agccgacgac caggguucaa 60 
aucccugacc gagcacca 78 

<210> 7 
<211> 78 
<212> RNA 

<213> Methanobacterium thermautotrophicus 
<400> 7 

gcuccgguag uguaguccgg ccaaucauuu cggccuuucg agccgaagac ucggguucaa 60 
aucccggccg gagcacca 78 

<210> 8 
<211> 78 
<212> RNA 

<213> Methanococcus jannaschii 
<400> 8 

gcuccggugg uguaguccgg ccaaucaugc gggccuuucg agcccgcgac ccggguucaa 60 
aucccggccg gagcacca 78 

<210> 9 
<211> 78 
<212> RNA 

<213> Methanococcus mar1pa1udis 

<400> 9 

gcuccagugg uguaguccgg ccaaucaucc ggcccuuucg aggccgggac ucggguucaa 60 
aucccggcug gagcacca 78 

<210> 10 
<211> 78 
<212> RNA 

<213> Methanosarcina mazei 
<400> 10 

gcuccgauag uguagcucgg ccaaucauuc aggacucuca cuccugcgac ugggguucaa 60 
auccccaucg gagcacca 78 

<210> 11 
<211> 78 
<212> RNA 

<213> Methanosarcina mazei 
<400> 11 

gcuccgguag uguaguccgg ccaaucauuc cggccuuucg agccgaagac ucggguucga 60 
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78 



<210> 12 
<211> 78 
<212> RNA 

<213> Pyrobaculum aerophilum 
<400> 12 

cagccgguag ucuagcccgg acaaggaugc gggccucucg agcccgugac ccggguucaa 60 
aucccggccg gggcacca 78 

<210> 13 
<211> 78 
<212> RNA 

<213> Pyrococcus abyssi 
<400> 13 

gccccggugg uguagcccgg ucaaucaugc gggacucucg aucccgcgac ccggguucaa 60 
aucccggccg gggcacca 78 

<210> 14 
<211> 78 
<212> RNA 

<213> Pyrococcus abyssi 
<400> 14 

gccccggugg uguagcccgg ucaaacaugc gggccuuucg agcccgcgcc ccggguucaa 60 
aucccggccg gggcacca 78 

<210> 15 
<211> 78 
<212> RNA 

<213> Pyrococcus furiosus 
<400> 15 

gccccggugg uguagcccgg ucaaucaugc gggacucucg aucccgcgac ccggguucaa 60 
aucccggccg gggcacca 78 

<210> 16 
<211> 78 
<212> RNA 

<213> pyrococcus furiosus 
<400> 16 

gccccggugg uguagcccgg ucaaacaugc gggccuuucg agcccgcgcc ccggguucaa 60 



aucccggccg gggcacca 



<210> 17 

<211> 78 

<212> RNA 

<213> Pyrococcus horikoshii 



78 



<400> 17 

gccccggugg uguagcccgg ucaaucaugc gggacucucg aucccgcgac ccggguucaa 60 



aucccggccg gggcacca 
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<210> 18 
<211> 78 
<212> RNA 

<213> Pyrococcus horikoshii 
<400> 18 

gccccggugg uguagcccgg ccaaacaugc gggccuuucg agcccgcgcc ccggguucaa 60 
aucccggccg gggcacca 78 

<210> 19 
<211> 77 
<212> RNA 

<213> Sulf Globus sol f atari cus 
<400> 19 

gccgcggugg uuuagcccgg ucagaaugcg ggccucucga gcccgugacc cggguucaaa 60 
ucccggccgc ggcacca 77 

<210> 20 
<211> 77 
<212> RNA 

<213> Sulfolobus sol f atari cus 
<400> 20 

gccgcggugg uuuagcccgg ucagaaugcg ggccuuucga gcccgugacc cggguucaaa 60 
ucccggccgc ggcacca 77 

<210> 21 
<211> 77 
<212> RNA 

<213> Sulfolobus tokodaii 
<400> 21 

gccgcgguag uauagcccgg ucaguaugcg ggccucucaa gcccguggcc cggguucaaa 60 
ucccggccgc ggcacca 77 

<210> 22 
<211> 78 
<212> RNA 

<213> Sulfolobus tokodaii 
<400> 22 

gccgcgguag uauagcccgg ucaagcacgc gggccuuucg agcccgugac ccggguucaa 60 
aucccggccg cggcacca 78 

<210> 23 
<211> 78 
<212> RNA 

<213> Thermoplasma acidophil urn 
<400> 23 

gcuccggugg uguaguccgg ccaagcauua gggacucuca aucccccgac ucggguccaa 60 
aucccgaccg gagcacca 78 
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<210> 24 
<211> 78 
<212> RNA 

<213> Thermoplasma acidophilum 
<400> 24 

gcuccggugg uguaguccgg acaagcauau cggccuuucg agccgacgac cuggguccaa 60 
aucccggccg gagcacca 78 

<210> 25 
<211> 78 
<212> RNA 

<213> Thermoplasma vol cam" urn 
<400> 25 

gcuccggugg uguaguccgg ccaagcauua gggacucuca aucccccgac ucggguccaa 60 
aucccgaccg gagcacca 78 

<210> 26 
<211> 78 
<212> RNA 

<213> Thermoplasma vol cam" urn 

<400> 26 

gcuccggugg uguaguccgg acaagcauau cggccuuucg agccgacgac cuggguccaa 60 
aucccggccg gagcacca 78 

<210> 27 

<211> 78 

<212> RNA 

<213> artificial 

<220> 

<223> consensus tRNA 
<400> 27 

gcyccggugg uguagcccgg ccaakcaugc gggccuyucg agcccgcgac ccggguucaa 60 
aucccggccg grgcacca 78 

<210> 28 

<211> 78 

<212> RNA 

<213> artificial 

<220> 

<223> mutant tRNA 
<400> 28 

gccccggugg uguagcccgg ccaagcaugc gggccucuaa agcccgcgac ccggguucaa 60 



aucccggccg gggcacca 

<210> 29 
<211> 76 
<212> RNA 

<213> Aeropyrum pernix 
<400> 29 

agccgggucg ucuagcggcc caggaugcgg ggcucuggcc cccgugaccg ggguucgaau 
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76 



<210> 30 
<211> 76 
<212> RNA 

<213> Aeropyrum pernix 
<400> 30 

agccgggucg ucuagcggcc caggaugcgg gguuuuggcc cccgugaccc ggguucgaau 60 
cccggcccgg cuacca 76 

<210> 31 
<211> 76 
<212> RNA 

<213> Archaeoglobus fulgidus 
<400> 31 

agccccgugg gguagcgguc aauccugccg gacucuggau ccggcgacgc cgguucgaau 60 
ccggccgggg cuacca 76 

<210> 32 
<211> 76 
<212> RNA 

<213> Archaeoglobus fulgidus 
<400> 32 

agccccgugg uguagcgguc aaucaugcgg gccuuuggag cccgcgaccg ggguucgaau 60 
ccccgcgggg cuacca 76 

<210> 33 
<211> 76 
<212> RNA 

<213> Halobacterium sp. 

<400> 33 

agucccguag gguaguggcc aauccugaag ccuucugggg gcuucgacgg aaguucgaau 60 
cuucccggga cuacca 76 

<210> 34 
<211> 76 
<212> RNA 

<213> Halobacterium sp. 
<400> 34 

agucccgugg uguaguggcc aaucauaugg gccuuuggag cccacgacgg cgguucgaau 60 
ccgcccggga cuacca 76 

<210> 35 
<211> 75 
<212> RNA 

<213> Methanobacterium thermautotrophicus 
<400> 35 

agucccgugg gguaauggca auccugaugg acucuggauc caucgauagc gguucgacuc 60 

cgcucgggac uacca 75 
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<210> 36 
<211> 75 
<212> RNA 

<213> Methanobacterium thermautotrophicus 
<400> 36 

agucccgugg gguaguggua auccugcugg gcuuuggacc cggcgacagc gguucgacuc 60 
cgcucgggac uacca 75 

<210> 37 
<211> 76 
<212> RNA 

<213> Methanococcus jannaschii 
<400> 37 

agcccggugg uguaguggcc uaucauccgg ggcuuuggac cccgggaccg cgguucgaau 60 
ccgcgccggg cuacca 76 

<210> 38 
<211> 76 
<212> RNA 

<213> Methanococcus maripaludis 
<400> 38 

agcccaguag uguaguggcc aaucauccgu gccuuuggag caugggaccg cgguucgaau 60 
ccgcgcuggg cuacca 76 

<210> 39 
<211> 76 
<212> RNA 

<213> Methanopyrus kandleri 

<400> 39 

agggccgugg gguagcgguc uauccugcgg ggcuuuggac cccgcgaccc cgguucaaau 60 
ccgggcggcc cuacca 76 

<210> 40 
<211> 76 
<212> RNA 

<213> Methanosarcina mazei 
<400> 40 

agcccgguag uguagugguc aaucaugcgg gacucuggau ccugcaaccu cgguucgaau 60 
ccgugccggg cuacca 76 

<210> 41 
<211> 76 
<212> RNA 

<213> Methanosarcina mazei 
<400> 41 

aguccuguag gguagugguc aauccuucgg gccuuuggag cccgggacag cgguucgaau 60 
ccgcucagga cuacca 76 
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<210> 42 
<211> 75 
<212> RNA 

<213> Pyrobaculum aerophilum 
<400> 42 

agcccggucg ucuagcggcc aggaugcggg gcucuggacc ccguggcccg gguucgaauc 60 
ccggccgggc uacca 75 

<210> 43 
<211> 76 
<212> RNA 

<213> Pyrobaculum aerophilum 
<400> 43 

agcccggucg ucuagcggcc aaggaugcgg ggcuuuggac cccguggccc ggguucgaau 60 
cccggccggg cuacca 76 

<210> 44 
<211> 76 
<212> RNA 

<213> Pyrococcus abyssi 
<400> 44 

agccccgugg uguagcggcc aagcaugcgg gacucuggau cccgcgaccg ggguucgaau 60 
ccccgcgggg cuacca 76 

<210> 45 
<211> 76 
<212> RNA 

<213> pyrococcus abyssi 
<400> 45 

agccccgugg uguagcggcc aagcaugcgg gacuuuggau cccgcgaccc ggguucgaau 60 
cccggcgggg cuacca - 76 

<210> 46 
<211> 76 
<212> RNA 

<213> Pyrococcus furiosus 
<400> 46 

agccccgugg uguagcggcc aagcaugcgg gacucuggau cccgcgaccg ggguucgaau 60 



ccccgcgggg cuacca 



<210> 47 

<211> 76 

<212> RNA 

<213> Pyrococcus furiosus 



76 



<400> 47 

agccccgugg uguagcggcc aagcaugcgg gacuuuggau cccgcgaccc ggguucgaau 60 
cccggcgggg cuacca 76 



<210> 48 
<211> 76 
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<212> RNA 

<213> Pyrococcus horikoshii 
<400> 48 

agccccgugg uguagcggcc aagcaugcgg gacucuggau cccgcgaccg ggguucgaau 60 
ccccgcgggg cuacca 76 

<210> 49 
<211> 76 
<212> RNA 

<213> pyrococcus horikoshii 
<400> 49 

agccccgugg uguagcggcc aagcaugcgg gacuuuggau cccgcgaccc ggguucgaau 60 
cccggcgggg cuacca 76 

<210> 50 
<211> 76 
<212> RNA 

<213> Sulfolobus sol f atari cus 
<400> 50 

agccggguag ucuagugguc aaggauccag ggcucuggcc ccugggacca ggguucgaau 60 
cccugcccgg cuacca 76 

<210> 51 
<211> 76 
<212> RNA 

<213> Sulfolobus sol f atari cus 
<400> 51 

agccggguag ucuagugguc aaggauccag ggcuuuggcc ccugggacca ggguucgaau 60 
cccugcccgg cuacca 76 

<210> 52 
<211> 76 
<212> RNA 

<213> Sulfolobus tokodaii 
<400> 52 

agccgggucg ucuagugguc aaggaucgag ggcucuggcc ccucggaccu ggguucaaau 60 
cccagcccgg cuacca 76 

<210> 53 
<211> 76 
<212> RNA 

<213> Sulfolobus tokodaii 
<400> 53 

agccgggucg ucuagugguc aaggaucgag ggcuuuggcc ccucggaccu ggguucaaau 60 
cccagcccgg cuacca 76 

<210> 54 
<211> 76 
<212> RNA 

<213> Thermoplasma acidophil urn 
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<400> 54 

agcccugugg uguaguggcc aagcauuaug ggcucuggac ccaucgacgg caguucgaau 60 
cugcccaggg cuacca 76 

<210> 55 
<211> 76 
<212> RNA 

<213> Thermoplasma acidophil urn 
<400> 55 

agcccugugg uguaguggac aagcauuuug gacuuuggau ccaaagacgg caguucgaau 60 
cugcccaggg cuacca 76 

<210> 56 
<211> 76 
<212> RNA 

<213> Thermoplasma volcanium 

<400> 56 

agcccugugg uguagcggcc aagcauuaug ggcucuggac ccaucgacgg caguucgaau 60 
cugcccaggg cuacca 76 

<210> 57 
<211> 76 
<212> RNA 

<213> Thermoplasma volcanium 
<400> 57 

agcccugugg uguaguggac aagcauuuug gacuuuggau ccaacgacgg caguucgaau 60 
cugcccaggg cuacca 76 

<210> 58 

<211> 76 

<212> RNA 

<213> artificial 

<220> 

<223> consensus tRNA 

<400> 58 

agccccgugg uguagyggcc aagcaugcgg grcuyuggay cccgcgaccg sgguucgaau 60 
ccscgcgggg cuacca 76 

<210> 59 

<211> 76 

<212> RNA 

<213> artificial 

<220> 

<223> mutant tRNA 
<400> 59 

agccccgugg uguagcggcc aagcaugcgg ggcucuaaac cccgcgaccg ggguucgaau 60 
ccccgcgggg cuacca 76 
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<210> 60 
<211> 76 
<212> RNA 
<213> artificial 

<220> 

<223> mutant tRNA 
<400> 60 

agccccgugg uguagcggcc aagcacgcgg ggcucuaaac cccgcgaccg ggguucgaau 
ccccgcgggg cuacca 



<210> 61 

<211> 78 

<212> RNA 

<213> artificial 

<220> 

<223> consensus tRNA 
<400> 61 

gcyccggugg uguagcccgg ccaakcaugc gggccuyucg agcccgcgac ccggguucaa 
aucccggccg grgcacca 



<210> 62 

<211> 76 

<212> RNA 

<213> artificial 

<220> 

<223> consensus tRNA 

<400> 62 

agccccgugg uguagyggcc aagcaugcgg grcuyuggay cccgcgaccg sgguucgaau 
ccscgcgggg cuacca 



<210> 63 

<211> 78 

<212> RNA 

<213> artificial 

<220> 

<223> mutant tRNA 
<400> 63 

gccccggugg uguagcccgg ccaagcaugc gggccucuaa agcccgcgac ccggguucaa 
aucccggccg gggcacca 



<210> 64 

<211> 76 

<212> RNA 

<213> artificial 

<220> 

<223> mutant tRNA 
<400> 64 

agccccgugg uguagcggcc aagcaugcgg ggcucuaaac cccgcgaccg ggguucgaau 
ccccgcgggg cuacca 
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<210> 65 

<211> 76 

<212> RNA 

<213> artificial 

<220> 

<223> mutant tRNA 
<400> 65 

agccccgugg uguagcggcc aagcacgcgg ggcucuaaac cccgcgaccg ggguucgaau 60 
ccccgcgggg cuacca 76 

<210> 66 

<211> 78 

<212> RNA 

<213> artificial 

<220> 

<223> mutant tRNA 
<220> 

<2 2 1> mis c_f eat u re 

<222> (10) . . (10) 

<223> n is a, c, g, or u 

<220> 

<221> mi sc_feature 

<222> (28) . . (28) 

<223> n is a, c, g, or u 

<400> 66 

gccccggugn uguagcccgg ccaagcangc gggccucuaa agcccgcgac ccggguucaa 60 
aucccggccg gggcacca 78 

<210> 67 

<211> 78 

<212> RNA 

<213> artificial 

<220> 

<223> mutant tRNA 
<400> 67 

gccccggugg uguagcccgg ccaagcacgc gggccucuaa agcccgcgac ccggguucaa 60 
aucccggccg gggcacca 78 

<210> 68 
<211> 1656 
<212> DNA 

<213> Archaeoglobus fulgidus 
<400> 68 

gtgaaagaag tcataatgaa atacgtagtt cagaacgccg caaagtacgg aaaagccagt 60 

gaaaaggctg taatggggaa ggtgatggca gaaaatccag aactcagaaa aaaagctaaa 120 

gaggtccttg aactcgtaaa ggagtgcatt accgagttcg aagcgctttc tgaagaagta 180 

aggaaggagc taatcaaaaa atacagcatg gatagcgaag ctaaaaggga gttggagacg 240 
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aAnaaact""tc 


cagagcttga 


aaaQQCtQaQ 

yyyyy*-*"y**sj 


aaaQaaaaaci 


ttqtqatqaq 


attcgctcct 


300 


W&V^ ^ W4 \Jk 


gtccacctac 


actcggttct 


QCtaqqqqqa 

y*- *-«-*yyyyy** 


taatcgtcaa 


cggtgaatac 


360 




acgaagggaa 


gtacattatc 


aqqtttqacq 


atacagaccc 


caqaaccaag 


420 


ciy civ» v.y CA u, 


ttaaaaccta 


caaataatiac 

^y**y 


ctagaggaca 


ttqaatqqct 


cggttacaag 


480 




ttattliacac 


ctcaaaaaoQ 


attcccatct 

*A W W %• ^» W V* W 


actacgatta 


tgcaagaaag 


540 


1" 1" a 1" a ri a a a 




ctacacctac 


■ttctacaatc 

W W W ^4 V« VA W %v 


aaaaaaaat't 

ayyayyay u *- 


caagaagttc 


600 


annnacaat"a 




cccacacaoa 


aaca'tcagcg 


tcgaggatac 


acttgaagtc 


660 




««y V. k wy«A«Ayy 


agactatgag 


aaaqaaaaaa 


tcgttcttag 


aattaagacg 


720 


aatra'tacacc 


acaaggatcc 


aacaataaQQ 


QactQqqtqq 

y **yy57 ^sjsj 


ctttcaggat 


aataaaggaa 


780 


"tcacatcctc 




taaatatgtt 


gtctatccca 


cactcgattt 


tgaatctgct 


840 


at:aaaaaat!C 


acctit'ttaaa 


cataacgcac 


attatcaggg 


gtaaggactt 


aattgattct 


900 




aaaaa'taca't 


ctacoao'ta't 


tttaactaaa 

w w w^*jxa ^ *j y 


tttaccccat 


tacaaagcac 


960 


'-yyyy^'^yyy 


"ttaaaa'tc't't 


caaa'tlicaaa 

\t§ W W W«%y %im 


aagctiatcga 


cttcttcaat 


taagaaggat 


1020 


at't'naaaaaa 


ntaanta'taa 

y ^cmy ^yn 


yyy^ *-yyy** 


gacccaaggc 


tgccaacctt 


gagagccttc 


1080 


a nn a noana a 
<*yy wyy»y<*y 


yo i> L Lyny 


'taaaacca'ta 


aagagcttct 


tcctttcgct 


qqqaqttqqc 

yy33 ^373^ 


1140 


nanaafnam 




'tc'tcaaaaa't 

W W V^%A%A%4 %AW^ W 


ctttiacaclia 

Wi W W W%A^i»^4^* 


agaacaggaa 


aatcatcgat 


1200 


ffifaannr'aa 


a r r at "t a ctl" 

ci^v.y i_ i_cLv> ^ ^ 


tttcatttaa 

^ k w u w w ^ y y 


gggcccgtga 


agattgaaa't 


cgttaacctg 


1260 


rTTinanaana 


aananolrna 

cmyciyy ^\_ycL 


oc'tcccacT.Q 

y V. V. V. v> v>y ^ w \A 


aatccacaca 


caaacaaaaa 

^y y y ^y**y 


qaqaaqqctq 


1320 


aaannlTiaaa 


naactata'ta 


l:at:l:acaaaa 


gacgacttcg 


agaggttgaa 


aqqqcaqqtt 

27 27 ^ ^ 


1380 


gtaaggctga 


aggacttctg 


caacgttctg 


cttgacgaga 


aggcagagft 


ca xggg L L lt 


1 AAfi 


gagcttgagg 


gggtcaagaa 


ggggaagaac 


ataatccact 


ggcttcctga 


gagcgaggct 


1500 


atcaagggca 


aggttatcgg 


cgaaagggaa 


gcggaagggc 


ttgtggagag 


aaacgcagtt 


1560 


agagatgtcg 


ggaaagtcgt 


gcagttcgag 


agatttgctt 


tctgtaaagt 


agagtctgct 


1620 


gacgaggaac 


tggtcgcggt 


gtacacgcac 


ccgtga 






1656 



<210> 69 
<211> 551 
<212> PRT 

<213> Archaeoglobus fu1g1dus 
<400> 69 

val Lys Glu val lie Met Lys Tyr val Val Gin Asn Ala Ala Lys Tyr 
15 10 15 

Gly Lys Ala Ser Glu Lys Ala val Met Gly Lys val Met Ala Glu Asn 
20 25 30 

Pro Glu Leu Arg Lys Lys Ala Lys Glu val Leu Glu Leu val Lys Glu 
35 40 45 
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Cys lie Thr Glu Phe Glu Ala Leu Ser Glu Glu val Arg Lys Glu Leu 
50 55 60 

lie Lys Lys Tyr Ser Met Asp Ser Glu Ala Lys Arg Glu Leu Glu Thr 
65 70 75 80 

Lys Lys Leu Pro Glu Leu Glu Gly Ala Glu Lys Gly Lys val val Met 
85 90 95 

Arg Phe Ala Pro Asn Pro Asn Gly Pro Pro Thr Leu Gly Ser Ala Arg 
100 105 110 

Gly He lie Val Asn Gly Glu Tyr Ala Lys Met Tyr Glu Gly Lys Tyr 
115 120 125 

lie lie Arg Phe Asp Asp Thr Asp Pro Arg Thr Lys Arg Pro Met lie 
130 135 140 

Glu Ala Tyr Glu Trp Tyr Leu Glu Asp lie Glu Trp Leu Gly Tyr Lys 
145 150 155 160 

pro Asp Glu val lie Tyr Ala ser Arg Arg lie Pro lie Tyr Tyr Asp 
165 170 175 

Tyr Ala Arg Lys Leu lie Glu Met Gly Lys Ala Tyr Thr cys Phe Cys 
180 185 190 

ser Gin Glu Glu Phe Lys Lys Phe Arg Asp ser Gly Glu Glu Cys Pro 
195 200 205 

His Arg Asn lie Ser val Glu Asp Thr Leu Glu val Trp Glu Arg Met 
210 215 220 

Leu Glu Gly Asp Tyr Glu Glu Gly Glu val val Leu Arg lie Lys Thr 
225 y ^ ^ 230 235 240 

Asp Met Arg His Lys Asp Pro Ala lie Arg Asp Trp val Ala Phe Arg 
245 250 255 

lie lie Lys Glu Ser His Pro Leu val Gly Asp Lys Tyr val val Tyr 
260 265 270 

Pro Thr Leu Asp Phe Glu ser Ala lie Glu Asp His Leu Leu Gly lie 
275 280 285 

Thr His He He Arg Gly Lys Asp Leu lie Asp ser Glu Arg Arg Gin 
290 295 300 

Arg Tyr lie Tyr Glu Tyr Phe Gly Trp He Tyr Pro He Thr Lys His 
305 310 315 320 
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Trp Gly Arg val 



Lys 
325 



lie Phe Glu Phe Gly Lys Leu Ser Thr ser 
330 335 



Ser 



lie Lys Lys Asp He Glu Arg Gly Lys Tyr Glu Gly Trp Asp Asp Pro 
340 345 350 



Arg Leu Pro Thr Leu Arg Ala Phe Arg Arg Arg Gly Phe Glu Pro Glu 
^ 355 360 365 



Ala lie Lys Ser Phe Phe Leu Ser Leu Gly Val Gly Glu Asn Asp val 
370 375 380 



Ser Val Ser Leu Lys Asn Leu Tyr Ala Glu Asn Arg Lys lie lie Asp 
385 390 395 400 



Arg Lys Ala Asn Arg Tyr Phe Phe lie Trp Gly Pro Val Lys lie Glu 
405 410 415 



lie val Asn Leu Pro Glu Lys Lys Glu val Glu Leu Pro Leu Asn Pro 
420 425 430 



His Thr Gly Glu Lys Arg Arg Leu Lys Gly Glu Arg Thr lie Tyr val 
435 440 445 



Thr Lys Asp Asp Phe Glu Arg Leu Lys Gly Gin Val val Arg Leu Lys 
450 455 460 



Asp Phe Cys Asn val Leu Leu Asp Glu Lys Ala Glu Phe Met Gly Phe 
465 470 475 480 



Glu Leu Glu Gly val Lys Lys Gly Lys Asn lie lie His Trp Leu Pro 
485 490 495 



Glu Ser Glu Ala lie Lys Gly Lys val lie Gly Glu Arg Glu Ala Glu 
500 505 510 



Gly Leu val Glu Arg Asn Ala val Arg Asp val Gly Lys val val Gin 
515 520 525 



Phe Glu Arg Phe Ala Phe Cys Lys val Glu Ser Ala Asp Glu Glu Leu 
530 535 540 



Val Ala val Tyr Thr His Pro 



<210> 70 
<211> 1635 
<212> DNA 

<213> Aeuropyrum pernix 
<400> 70 

atgtcaatgc tcctagggga ccatcctgag cttaggagta gggcgaggga gatagcctcc 



545 



550 



Page 15 



54-000510US . ST2 5 . txt 



ctggctgcaa 


aaQtcQtaaa 

yyy *-^y *"yy** 


Qcaaqtaaac 


agcatgcctg 


ctqqcqaqca 


gaagaggctc 


120 


ctctcaaaac 


agtatccgga 


actaacciaqq 


tttgaagagc 


agcgcgagaa 


aggcgataaa 


180 


ggcc^acccc 


cactacccQQ 


aQCCQtaqaa 


qqcaqqqtca 

3737^**37 3J37 


aqctqaqqtt 


cgcccccaac 


240 


ccagactttg 


tgatacacat 


aaacaacQct 


aggccggcca 


tagtgaacca 


cgagtacgcc 


300 


aggatgta'ta 


aaaaaaaaat 


aatQCtQCQQ 


ttcgaagaca 


cagaccctag 


gacaaagact 


360 


ccccliacaaa 


aggcgtacga 


cctcatccgc 


caggacctca 


aqtqqcttqq 

**37 ^ 3737^ ^ **3737 


cgtttcctgg 


420 


aacaaaoaat: 


acatccaaaa 


cctcaaQatQ 


gaggtcttct 


acagcgtagc 


cagaagggct 


480 




ao'tacacc'ta 


cgncgacaac 


tQtqqqaqaq 

^37 ^*7 37 37** 37** 37 


agggtaagga 


gctgctctcg 


540 




antatccaac 


cagggatctt 


qqccccqaqq 


ataatctgga 


qctqtttqag 


600 


aagatigcttig 


aaggagagtt 


ttacQaqqqq 


qaqqctqttq 

37 3 37 37 **37 


tgagqatgaa 

** 37 37 37 


gacqqacccq 


660 


cgccacccca 


accccagcct 


qaaqqactqq 


gtggccatga 


ggattataga 


cacggagaaa 


720 


cacccccacc 


CCCtCatQQQ 


qaqcaqqtac 


ctqqtqtqqc 


cgacctataa 


cttcgccgtc 


780 


agcgtcgacg 


accacatgat 


ggagataacc 


cacgtgctca 


qqqqtaaqqa 

37373737 ^^•*»3737*^ 


gcaccagctg 


840 


aacaccgaga 


agcagcttgc 


cqtctacaqq 


tqtatqqqct 


ggaggccgcc 


gtacttcatc 


900 


cacntcaaca 


ggctaaagct 


37 57 "'373737 37 


atactgagca 


agagcaagat 


aagaaagctc 


960 


ctcaaaaaaa 


aoccaaaaaa 

\A \m fcJL ^4 ^*^4 


ati'tca'taaac 

^4 ^ w «A w 7^ 7*4 


'tacaacaacc 


CCCQCttCQQ 

^» ^»^4 ^» W W ^4 ^4 


cacgatagcc 


1020 


27 Z3 z3 ^ ^V4Ci\J\4vL 


yyayyyy uy i- 


y^ *-yy v-yyo-y 


Qcca'taaaac 


agataa'tact 


cqaqqtqqqq 

*"37**3737 **37s7sa37 


1080 


Q'tcaaaccca 


caaacactac 


aa'taaac'taa 


gcaaacctigg 


cggccgctaa 


taqqaaqctq 


1140 


\« ^ y y ^ ^ y i7 


aaacaaacao 

yyy^yy"*»**y 


aataatatat 


g'tcgaagacc 


ccgttgagat 


□qaaqtcqaq 

yy"yy »-'«-y»-*y 


1200 




«xyy**y *-y ^**y 


aaccQcaaaa 


ataccgttcc 


acccgagccg 


cccccagagg 


1260 


3303003.003. 


"taacctta"ta 


cacaoaaaac 

\tK Kmm K4 K4 %4 X4 %4 V4i V* 


aaggttctcc 


tcacaaggga 


aqacqcaqta 


1320 




aactaaaQCt 


cataaaccta 


tcaaacttca 


ccgtcagcca 


aggcatattg 


1380 


agggaggtgg 


atcccagcct 


cgagtatgcg 


aggagaatga 


agctacctat 


ag'tacaatgg 




gtgaagaagg 


gcggagaagc 


tagcgtagag 


gtactcgagc 


ccgtcgagct 


ggagctcagg 


1500 


aggcaccagg 


gctacgctga 


ggacgctatt 


aggggctatg 


gggttgacag 


ccgcctacag 


1560 


tttgtaaggt 


acggttttgt 


gagggtggac 


agcgtggaag 


acggggtcta 


cagggtgata 


1620 


tacacacaca 


agtag 










1635 



<210> 71 
<211> 544 
<212> PRT 

<213> Aeuropyrum pernix 
<400> 71 

Met Ser Met Leu Leu Gly Asp His Pro Glu Leu Arg Ser Arg Ala Arg 
15 10 15 



Glu lie Ala ser Leu Ala Ala Arg val val Glu Gin val Asn ser Met 
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20 25 30 

Pro Ala Gly Glu Gin Lys Arg Leu Leu Ser Glu Gin Tyr Pro Glu Leu 
35 ^ 40 45 

Ala Arg Phe Glu Glu Gin Arg Glu Lys Gly Asp Lys Gly Leu Pro Pro 
50 55 60 

Leu Pro Gly Ala val Glu Gly Arg val Lys Leu Arg Phe Ala Pro Asn 
65 70 75 80 

Pro Asp Phe val lie His Met Gly Asn Ala Arg Pro Ala lie val Asn 
85 90 95 

His Glu Tyr Ala Arg Met Tyr Lys Gly Arg Met val Leu Arg Phe Glu 
100 105 110 

Asp Thr Asp Pro Arg Thr Lys Thr Pro Leu Arg Glu Ala Tyr Asp Leu 
115 « ^ -L20 125 

lie Arg Gin Asp Leu Lys Trp Leu Gly Val Ser Trp Asp Glu Glu Tyr 
130 135 140 

lie Gin ser Leu Arg Met Glu val Phe Tyr ser val Ala Arg Arg Ala 
145 150 155 160 

lie Glu Arg Gly Cys Ala Tyr val Asp Asn cys Gly Arg Glu Gly Lys 
165 170 175 

Glu Leu Leu Ser Arg Gly Glu Tyr Cys Pro Thr Arg Asp Leu Gly Pro 
180 185 190 

Glu Asp Asn Leu Glu Leu Phe Glu Lys Met Leu Glu Gly Glu Phe Tyr 
195 200 205 

Glu Gly Glu Ala val val Arg Met Lys Thr Asp Pro Arg His Pro Asn 
210 215 220 

Pro Ser Leu Arg Asp Trp val Ala Met Arg lie lie Asp Thr Glu Lys 
225 230 235 240 

His Pro His Pro Leu val Gly Ser Arg Tyr Leu val Trp Pro Thr Tyr 
245 250 255 

Asn Phe Ala val ser val Asp Asp His Met Met Glu lie Thr His val 
260 265 270 

Leu Arg Gly Lys Glu His Gin Leu Asn Thr Glu Lys Gin Leu Ala val 
275 280 285 

Tyr Arg Cys Met Gly Trp Arg Pro Pro Tyr Phe lie His Phe Gly Arg 
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290 295 300 

Leu Lys Leu Glu Gly Phe lie Leu Ser Lys Ser Lys lie Arg Lys Leu 
305 310 315 320 

Leu Glu Glu Arg Pro Gly Glu Phe Met Gly Tyr Asp Asp Pro Arg Phe 
325 330 335 

Gly Thr lie Ala Gly Leu Arg Arg Arg Gly val Leu Ala Glu Ala lie 
340 345 350 

Arg Gin lie lie Leu Glu val Gly val Lys Pro Thr Asp Ala Thr lie 
355 360 365 

ser Trp Ala Asn Leu Ala Ala Ala Asn Arg Lys Leu Leu Asp Glu Arg 
370 375 380 

Ala Asp Arg lie Met Tyr val Glu Asp Pro val Glu Met Glu Val Glu 
385 390 395 400 

Leu Ala Gin Val Glu Cys Arg Ala Ala Glu lie Pro Phe His Pro ser 
405 410 415 

Arg Pro Gin Arg Lys Arg Arg lie Thr Leu Cys Thr Gly Asp Lys val 
420 425 430 

Leu Leu Thr Arg Glu Asp Ala val Glu Gly Arg Gin Leu Arg Leu Met 
435 440 445 

Gly Leu Ser Asn Phe Thr Val Ser Gin Gly lie Leu Arg Glu val Asp 
450 455 460 

Pro ser Leu Glu Tyr Ala Arg Arg Met Lys Leu Pro lie val Gin Trp 
465 470 475 480 

Val Lys Lys Gly Gly Glu Ala ser val Glu val Leu Glu Pro val Glu 
485 490 495 

Leu Glu Leu Arg Arg His Gin Gly Tyr Ala Glu Asp Ala lie Arg Gly 
500 505 510 

Tyr Gly Val Asp Ser Arg Leu Gin Phe val Arg Tyr Gly Phe val Arg 
515 520 525 

val Asp Ser Val Glu Asp Gly val Tyr Arg val lie Tyr Thr His Lys 
530 535 540 

<210> 72 
<211> 1716 
<212> DNA 

<213> Methanosarcina mazeii 
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<400> 72 
atgaccttaa 


gtcctgaaga 


tattaaaata 


attgaaaaat 


acgctcttca 


aaatgctgta 


60 


aaatacggaa 


aagccccaca 


gcccaaagct 


gttatgggta 


aagttatggg 


ggaatgtccc 


120 


cagttgaggg 


ctgaccccag 


cggggtatct 


gctgcccttg 


aaaatatagt 


ttcagaaatt 


180 


gctaaaggaa 


accctgaagc 


ctgggaagcc 


cgcctgtcgg 


aaatcgctcc 


cgagcttata 


240 


gaggctctga 


gtgtaaagaa 


agagcctgac 


aaaggcttaa 


aacctcttga 


gggagccgaa 


300 


accggtaagg 


ttgtaatgcg 


ctttgcccca 


aatcccaacg 


gacccgcaac 


cctgggaagt 


360 


tcaaggggaa 


tggttgtcaa 


ttccgagtat 


gtaaaagtgt 


ataaaggaaa 


gtttatcctg 


420 


cgctttgatg 


acactgaccc 


tgatataaag 


cgtcccatgc 


tcgaagccta 


tgactggtac 


480 


atggacgact 


ttaaatggct 


cggggttgtg 


cctgaccagg 


ttgtacgtgc 


ctctgaccgt 


540 


ttcccgattt 


attacgatta 


tgcaagaaaa 


ctgattgaga 


tgggaaaagc 


ctatgtctgt 


600 


ttctgtaaag 


gagaagactt 


caaaaggtta 


aaagactcca 


aaaaggcctg 


ccctcacagg 


660 


gacacagacc 


ctgaagaaaa 


ccttatgcac 


tgggaaaaga 


tgcttgccgg 


agaatatgaa 


720 


gaccagcagg 


ctgttctgcg 


cataaagacc 


gatatagagc 


acaaagaccc 


tgcacttcgg 


780 


gactggggag 


cattcaggat 


tagaaaaatg 


tcccacccac 


gccctgagat 


tggaaataaa 


840 


tacatcgtct 


ggccccttct 


ggactttgca 


ggtgctatcg 


aagaccacga 


gctcggtatg 


900 


acccacatca 


tccggggtaa 


agaccttatc 


gacagtgaaa 


aacggcagac 


atacatttac 


960 


aaatacttcg 


gctggacata 


cccgaaaaca 


actcactggg 


gcagggtaaa 


gatccatgag 


1020 


ttcggaaagt 


tcagcacaag 


taccctgcgt 


aaagctatcg 


aatccggaga 


atacagcggc 


1080 


tgggacgacc 


cccgcctgcc 


gacaataagg 


gcaatcaggc 


gccgtggaat 


tagggctgaa 


1140 


gctctcaaaa 


agttcatgat 


agaaatggga 


gtcgggatga 


ccgatgtgag 


catcagcatg 


1200 


gaatctcttt 


atgccgagaa 


ccgcaaaatc 


gtggacccgg 


ttgcaaaccg 


ctatttcttt 


1260 


gtctggaacc 


ctgtggagct 


tgaaattgaa 


ggtatgaaac 


ctgtggtcgc 


aaaggtcccc 


1320 


cgtcacccaa 


cagaccatgc 


aagaggaatg 


agggaaatct 


ctatcgaaaa 


taaggttctt 


1380 


gtctgtgccg 


aggacattga 


aaagctcaat 


gtaggttctg 


tcctccgctt 


aaaagatctc 


1440 


tgcaacgttg 


aaattacttc 


cctttccccg 


ctgcgggtta 


agcgctccga 


aacctctctc 


1500 


gaagacctca 


aaaaagcaaa 


aggaaaaatc 


atccactggg 


tgccggttga 


tggaattcct 


1560 


gtgaaagtct 


gcgggccaga 


gggcgatatc 


gaagggacag 


gagaacgcgg 


gatcgagacc 


1620 


gagcttgata 


atattgtcca 


gtttgaacgc 


ttcggtttct 


gcaggatcga 


cgccgtagac 


1680 


ggagaaaatg 


tcgtggcgta 


ctttgctcac 


aaatga 






1716 



<210> 73 
<211> 571 
<212> PRT 

<213> Methanosarcina mazeii 
<400> 73 

Met Thr Leu ser Pro Glu Asp lie Lys lie lie Glu Lys Tyr Ala Leu 
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15 10 15 

Gin Asn Ala Val Lys Tyr Gly Lys Ala Pro Gin Pro Lys Ala val Met 
20 25 30 

Gly Lys val Met Gly Glu Cys Pro Gin Leu Arg Ala Asp Pro Ser Gly 
35 40 45 

Val ser Ala Ala Leu Glu Asn He val ser Glu lie Ala Lys Gly Asn 
50 55 60 

Pro Glu Ala Trp Glu Ala Arg Leu Ser Glu lie Ala Pro Glu Leu lie 
65 70 75 80 

Glu Ala Leu Ser val Lys Lys Glu Pro Asp Lys Gly Leu Lys Pro Leu 
85 90 95 

Glu Gly Ala Glu Thr Gly Lys val val Met Arg Phe Ala Pro Asn Pro 
100 105 110 

Asn Gly Pro Ala Thr Leu Gly Ser ser Arg Gly Met val val Asn Ser 
115 120 125 

Glu Tyr val Lys val Tyr Lys Gly Lys Phe lie Leu Arg Phe Asp Asp 
130 135 140 

Thr Asp Pro Asp lie Lys Arg Pro Met Leu Glu Ala Tyr Asp Trp Tyr 
145 150 155 160 

Met Asp Asp Phe Lys Trp Leu Gly val val Pro Asp Gin val val Arg 
165 170 175 

Ala Ser Asp Arg Phe Pro lie Tyr Tyr Asp Tyr Ala Arg Lys Leu lie 
180 185 190 

Glu Met Gly Lys Ala Tyr val cys Phe Cys Lys Gly Glu Asp Phe Lys 
195 200 205 

Arg Leu Lys Asp Ser Lys Lys Ala Cys Pro His Arg Asp Thr Asp Pro 
210 215 220 

Glu Glu Asn Leu Met His Trp Glu Lys Met Leu Ala Gly Glu Tyr Glu 
225 230 235 240 

Asp Gin Gin Ala val Leu Arg He Lys Thr Asp lie Glu His Lys Asp 
245 250 255 

Pro Ala Leu Arg Asp Trp Gly Ala Phe Arg lie Arg Lys Met Ser His 
260 265 270 

Pro Arg Pro Glu lie Gly Asn Lys Tyr lie val Trp Pro Leu Leu Asp 
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275 280 285 

Phe Ala Gly Ala He Glu Asp His Glu Leu Gly Met Thr His He He 
290 295 300 

Arg Gly Lys Asp Leu lie Asp Ser Glu Lys Arg Gin Thr Tyr lie Tyr 
305 310 315 320 

Lys Tyr Phe Gly Trp Thr Tyr Pro Lys Thr Thr His Trp Gly Arg val 
325 330 335 

Lys lie His Glu Phe Gly Lys Phe Ser Thr Ser Thr Leu Arg Lys Ala 
340 345 350 

lie Glu ser Gly Glu Tyr Ser Gly Trp Asp Asp Pro Arg Leu Pro Thr 
355 360 365 

lie Arg Ala lie Arg Arg Arg Gly lie Arg Ala Glu Ala Leu Lys Lys 
370 375 380 

Phe Met lie Glu Met Gly val Gly Met Thr Asp val ser lie ser Met 
385 390 395 400 

Glu ser Leu Tyr Ala Glu Asn Arg Lys lie val Asp Pro val Ala Asn 
405 410 415 

Arg Tyr Phe Phe val Trp Asn Pro val Glu Leu Glu lie Glu Gly Met 
420 425 430 

Lys Pro val val Ala Lys val Pro Arg His Pro Thr Asp His Ala Arg 
435 440 445 

Gly Met Arg Glu lie Ser lie Glu Asn Lys val Leu val cys Ala Glu 
450 455 460 

Asp lie Glu Lys Leu Asn val Gly Ser val Leu Arg Leu Lys Asp Leu 
465 470 475 480 

Cys Asn val Glu lie Thr Ser Leu Ser Pro Leu Arg val Lys Arg Ser 
485 490 495 

Glu Thr Ser Leu Glu Asp Leu Lys Lys Ala Lys Gly Lys lie lie His 
500 505 510 

Trp val Pro val Asp Gly lie Pro val Lys val Cys Gly Pro Glu Gly 
515 520 525 

Asp lie Glu Gly Thr Gly Glu Arg Gly lie Glu Thr Glu Leu Asp Asn 
530 535 540 

He val Gin Phe Glu Arg Phe Gly Phe Cys Arg He Asp Ala val Asp 

Page 21 



545 



54-000510US . ST25 . txt 
555 



Gly Glu Asn val val Ala Tyr Phe Ala His Lys 
565 570 

<210> 74 
<211> 1662 
<212> DNA 

<213> Methanobacterium thermoautotrophicum 



gtggtgccag 


tggaggacct 


tgtctacagg 


tacgcactcc 


tgaacgctgt 


gaaacacagg 


60 


ggaagggcaa 


acccgggggc 


cgtcatgggg 


gccgtcatga 


gcaacgaacc 


tgaactcagg 


120 


aagatggcgc 


cccaggttaa 


ggaagcagtt 


gaggccgctg 


ttgagagggt 


taacagttta 


180 


agccccgagg 


agcagcagca 


ggagatggag 


aggctgggcc 


ttgagatcac 


cgagaggaaa 


240 


cagaagaagc 


ggaagggtct 


aagggaactg 


gccggggtga 


agggtgaggt 


ggtcctcagg 


300 


ttcgccccca 


accccagcgg 


acccctccac 


ataggccatg 


caagggccgc 


gatcctcaac 


360 


catgaatatg 


caaggaaata 


tgacggcagg 


ctcatcctca 


ggatagagga 


cacggacccc 


420 


cgcagggttg 


acccggaggc 


ctacgatatg 


attccagccg 


accttgagtg 


gctgggcgtt 


480 


gaatgggatg 


agacagttat 


ccagagcgac 


cgcatggaaa 


cctactatga 


gtacacagag 


540 


aaactcatag 


agaggggtgg 


tgcgtatgta 


tgcacatgca 


ggccggagga 


gttcagggaa 


600 


ctcaagaaca 


ggggagaggc 


ctgtcactgc 


aggtcccttg 


gtttcaggga 


gaacctccag 


660 


cgatggaggg 


aaatgttcga 


gatgaaggag 


ggctcagccg 


ttgtgagggt 


taaaacggac 


720 


ctgaaccacc 


caaaccctgc 


cataagagac 


tgggtctcaa 


tgaggatcgt 


tgaggcagag 


780 


cacccacgca 


ccggtacacg 


ctacagggtc 


taccccatga 


tgaacttctc 


agtggcggtt 


840 


gatgaccacc 


tccttggcgt 


gacacacgtc 


ctgaggggta 


aggaccacct 


ggcaaacaga 


900 


gagaagcagg 


agtacctcta 


caggcacctt 


ggctgggagc 


cccccgaatt 


catacactac 


960 


gggcgcctga 


agatggacga 


cgttgcactc 


agcacctcgg 


gggcccgtga 


gggcatcctc 


1020 


aggggtgagt 


attctggatg 


ggacgaccca 


cgcctcggaa 


ccctgagggc 


cattgcaagg 


1080 


aggggtatac 


gaccggaggc 


cataagaaag 


ttaatggttg 


aaatcggcgt 


aaagatagca 


1140 


gattccacaa 


tgagctggaa 


gaagatctac 


ggcctcaaca 


ggagcatcct 


cgaagaggag 


1200 


gccaggaggt 


acttctttgc 


tgcagatcct 


gttaaacttg 


aagtggtggg 


tttacccgga 


1260 


cctgtcaggg 


ttgaaaggcc 


cctacacccg 


gaccaccctg 


aaatcgggaa 


cagggtactt 


1320 


gaactgaggg 


gtgaggtgta 


cctgccgggc 


gatgaccttg 


gggaaggacc 


gctgcgcctc 


1380 


atagacgcgg 


ttaacgtgat 


ctactctggg 


ggtgaactga 


ggtaccacag 


tgagggcatc 


1440 


gaggaggccc 


gtgaactcgg 


cgcttcaatg 


atacactggg 


tccctgcaga 


gtccgccctt 


1500 


gaggccgagg 


tcatcatgcc 


ggacgcctcc 


agggtgaggg 


gagtcatcga 


ggctgacgca 


1560 


tcagaactgg 


aggttgatga 


tgttgtgcag 


cttgagaggt 


tcggcttcgc 


ccgcctggat 


1620 


tctgcaggcc 


caggaatggt 


cttctactat 


gcccacaaat ga 
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<210> 75 
<211> 553 
<212> PRT 

<213> Methanobacterium thermoautotrophicum 
<400> 75 

val val Pro val Glu Asp Leu val Tyr Arg Tyr Ala Leu Leu Asn Ala 
15 10 15 

val Lys His Arg Gly Arg Ala Asn Pro Gly Ala val Met Gly Ala val 
20 25 30 

Met Ser Asn Glu Pro Glu Leu Arg Lys Met Ala Pro Gin val Lys Glu 
35 40 45 

Ala Val Glu Ala Ala val Glu Arg val Asn ser Leu Ser Pro Glu Glu 
50 55 60 

Gin Gin Gin Glu Met Glu Arg Leu Gly Leu Glu lie Thr Glu Arg Lys 
65 70 75 80 

Gin Lys Lys Arg Lys Gly Leu Arg Glu Leu Ala Gly val Lys Gly Glu 
85 90 95 

val val Leu Arg Phe Ala Pro Asn Pro Ser Gly Pro Leu His lie Gly 
100 105 110 

His Ala Arg Ala Ala lie Leu Asn His Glu Tyr Ala Arg Lys Tyr Asp 
115 120 125 

Gly Arg Leu lie Leu Arg lie Glu Asp Thr Asp Pro Arg Arg val Asp 
130 135 140 

Pro Glu Ala Tyr Asp Met lie Pro Ala Asp Leu Glu Trp Leu Gly val 
145 150 155 160 

Glu Trp Asp Glu Thr val lie Gin ser Asp Arg Met Glu Thr Tyr Tyr 
165 170 175 

Glu Tyr Thr Glu Lys Leu lie Glu Arg Gly Gly Ala Tyr val Cys Thr 
180 185 190 

Cys Arg Pro Glu Glu Phe Arg Glu Leu Lys Asn Arg Gly Glu Ala Cys 
195 200 205 

His Cys Arg ser Leu Gly Phe Arg Glu Asn Leu Gin Arg Trp Arg Glu 
210 215 220 

Met Phe Glu Met Lys Glu Gly ser Ala val val Arg val Lys Thr Asp 
225 230 235 240 
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Leu Asn His Pro Asn Pro Ala lie Arg Asp Trp val Ser Met Arg lie 
245 250 255 

val Glu Ala Glu His Pro Arg Thr Gly Thr Arg Tyr Arg val Tyr Pro 
260 265 270 

Met Met Asn Phe Ser val Ala val Asp Asp His Leu Leu Gly val Thr 
275 280 285 

His Val Leu Arg Gly Lys Asp His Leu Ala Asn Arg Glu Lys Gin Glu 
290 ^ ^ ^ 295 300 

Tyr Leu Tyr Arg His Leu Gly Trp Glu Pro Pro Glu Phe lie His Tyr 
305 y ^ 310 315 320 

Gly Arg Leu Lys Met Asp Asp Val Ala Leu ser Thr Ser Gly Ala Arg 
325 330 335 

Glu Gly lie Leu Arg Gly Glu Tyr ser Gly Trp Asp Asp Pro Arg Leu 
340 345 350 

Gly Thr Leu Arg Ala lie Ala Arg Arg Gly lie Arg Pro Glu Ala lie 
355 360 365 

Arg Lys Leu Met val Glu lie Gly val Lys lie Ala Asp Ser Thr Met 
370 375 380 

Ser Trp Lys Lys lie Tyr Gly Leu Asn Arg ser lie Leu Glu Glu Glu 
385 390 395 400 

Ala Arg Arg Tyr Phe Phe Ala Ala Asp Pro val Lys Leu Glu val val 
405 410 415 

Gly Leu Pro Gly Pro Val Arg val Glu Arg Pro Leu His Pro Asp His 
420 425 430 

Pro Glu lie Gly Asn Arg Val Leu Glu Leu Arg Gly Glu val Tyr Leu 
435 440 445 

Pro Gly Asp Asp Leu Gly Glu Gly Pro Leu Arg Leu lie Asp Ala Val 
450 455 460 

Asn val lie Tyr Ser Gly Gly Glu Leu Arg Tyr His Ser Glu Gly lie 
465 470 475 480 

Glu Glu Ala Arg Glu Leu Gly Ala Ser Met lie His Trp Val Pro Ala 
485 490 495 

Glu ser Ala Leu Glu Ala Glu val lie Met Pro Asp Ala ser Arg val 
500 505 510 
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Arg Gly val lie Glu Ala Asp Ala Ser Glu Leu Glu Val Asp Asp val 
515 520 525 

val Gin Leu Glu Arg Phe Gly Phe Ala Arg Leu Asp ser Ala Gly Pro 
530 535 540 

Gly Met Val Phe Tyr Tyr Ala His Lys 
545 550 

<210> 76 

<211> 1713 

<212> DNA 

<213> Pyrococcus horikoshii 

<400> 76 



ataaatataa 


aaaagatagc 


tctcaaacat 


gcccttatta 


atgcaattga 


Qcatqgaggt 


60 


aaggctaatc 


ttaaggccgt 


catcggtaaa 


□tactCQQcq 


agaaccctga 


gcttaggccc 


120 


aaaaccaaaa 


agataattcc 


cataattaat 


aaQQtaQtQQ 


aagaggtcaa 


ctcactagca 


180 


agggatgagc 


agctggaaaa 


QCttaaQQat 


atatatccag 


agtactttga 


aaagaaggaa 


240 


aaaaaaaaaa 


aaaagaaggg 


actacctctt 


cttcctaaag 


cagagaaagg 


caaggtcgtt 


300 


act cacti! "to 


ctccaaatcc 


agatggtgct 


tttcacctaa 


gtaatgcaag 


ggccgcaata 


360 


ct'taat'ta'ta 


aatacaccaa 


gatgtacggt 


gggaagttta 


tactccgctt 


tgatgatacc 


420 


aatccaaaaa 


'tcaaaaaacc 


tgaacccata 


ttctacaaga 


tgataattga 


agatctagag 


480 




taaaaccaaa 


tgaaatagtg 


tacgctagtg 


atagactcga 


aatttactac 


540 


aaatacacta 


aaaaactaa't 


aaaaatgggt 


aaagcatatg 


tctgcacttg 


tcctccagag 


600 


aaat:t:caaaa 


aocttiaaaaa 


taagggaatli 


ccatgtccgc 


atagagatga 


qcctqtqqaa 


660 


gttcagctcg 


agcgctggaa 


gaagaug L La 


aacggcgaa L 


aLaaagaagg 


dy ddy V. L.y l l 


770 


gtaaggataa 


agactgactt 


aaatcaccca 


aatcctgcgg 


ttagggattg 


gcccgcactt 


780 


agaataattg 


ataacccaaa 


ccatccgaga 


accggaaata 


agtacagggt 


atggcctctc 


840 


tacaattttg 


cctcagcgat 


tgatgatcac 


gagcttggag 


ttacccatat 


cttccgtgga 


900 


caggagcacg 


ccgagaacga 


gactagacag 


aggtatatct 


acgagtactt 


tggttgggag 


960 


tatcccgtca 


cgatccatca 


cgggaggctt 


agcatagagg 


gagttgtttt 


aagcaaatcc 


1020 


aagacaagga 


aaggaataga 


agaaggtaag 


tatctcggtt 


gggatgatcc 


taggcttgga 


1080 


acgataagag 


ctttgaggag 


gaggggaatt 


cttcctgaag 


ccataaaaga 


attgatcata 


1140 


gaagttggat 


tgaagaaaag 


tgacgcgacg 


atcagttggg 


agaatctggc 


agcgataaac 


1200 


agaaaacttg 


ttgatccaat 


tgccaataga 


tatttctttg 


ttgcagatcc 


aattccgatg 


1260 


gaagtcgaag 


gggctcctga 


atttatagca 


gagataccgc 


ttcatcctga 


ccatcctgaa 


1320 


aggggcgtta 


gaaggcttaa 


gttcactcca 


gaaagacctg 


tttacgtttc 


gaaagacgat 


1380 


ttgaatctct 


taaaaccagg 


caattttgtg 


aggttaaagg 


atctcttcaa 


tgttgaaata 


1440 


cttgaagttg 


gagacaaaat 


aagagctagg 


ttctacagct 


ttgaatatga 


aattgcaaag 


1500 
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aagaacaggt ggaagatggt tcactgggtt accgaaggta gaccctgtga ggttataatt 1560 

cctgaaggag acgagcttgt agtgaggaag ggattgctcg agaaggatgc caaagttcaa 1620 

gttaacgaaa tagtccagtt tgaacgtttc ggttttgtaa ggatagacag aattgagggg 1680 

gataaagtta tcgcaatata tgctcacaag taa 1713 

<210> 77 
<211> 570 
<212> PRT 

<213> Pyrococcus horikoshii 
<400> 77 

Met Asp val Glu Lys lie Ala Leu Lys His Ala Leu lie Asn Ala lie 
15 10 15 

Glu His Gly Gly Lys Ala Asn Leu Lys Ala val lie Gly Lys val Leu 
20 25 30 

Gly Glu Asn Pro Glu Leu Arg Pro Arg Ala Lys Glu lie lie Pro lie 
35 40 45 

lie Asn Lys val val Glu Glu val Asn Ser Leu Ala Arg Asp Glu Gin 
50 55 60 

Leu Glu Lys Leu Lys Asp lie Tyr Pro Glu Tyr Phe Glu Lys Lys Glu 
65 70 75 80 

Glu Lys Lys Glu Lys Lys Gly Leu Pro Leu Leu Pro Lys Ala Glu Lys 
85 90 95 

Gly Lys val Val Thr Arg Phe Ala Pro Asn Pro Asp Gly Ala Phe His 
100 105 110 

Leu Gly Asn Ala Arg Ala Ala lie Leu Ser Tyr Glu Tyr Ala Lys Met 
115 120 125 

Tyr Gly Gly Lys Phe lie Leu Arg Phe Asp Asp Thr Asp Pro Lys val 
130 135 140 

Lys Arg Pro Glu Pro lie Phe Tyr Lys Met lie lie Glu Asp Leu Glu 
145 150 155 160 

Trp Leu Gly lie Lys Pro Asp Glu lie val Tyr Ala Ser Asp Arg Leu 
165 170 175 

Glu lie Tyr Tyr Lys Tyr Ala Glu Glu Leu lie Lys Met Gly Lys Ala 
180 185 190 

Tyr val Cys Thr Cys Pro Pro Glu Lys Phe Arg Glu Leu Arg Asp Lys 
195 200 205 
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Gly lie Pro Cys Pro His Arg Asp Glu Pro Val Glu val Gin Leu Glu 
210 215 220 

Arg Trp Lys Lys Met Leu Asn Gly Glu Tyr Lys Glu Gly Glu Ala val 
225 230 235 240 

val Arg lie Lys Thr Asp Leu Asn His Pro Asn Pro Ala val Arg Asp 
245 250 255 

Trp Pro Ala Leu Arg lie lie Asp Asn Pro Asn His Pro Arg Thr Gly 
260 265 270 

Asn Lys Tyr Arg val Trp Pro Leu Tyr Asn Phe Ala Ser Ala lie Asp 
275 280 285 

Asp His Glu Leu Gly val Thr His lie Phe Arg Gly Gin Glu His Ala 
290 295 300 

Glu Asn Glu Thr Arg Gin Arg Tyr lie Tyr Glu Tyr Phe Gly Trp Glu 
305 310 315 320 

Tyr Pro val Thr lie His His Gly Arg Leu ser lie Glu Gly val val 
325 330 335 

Leu ser Lys Ser Lys Thr Arg Lys Gly lie Glu Glu Gly Lys Tyr Leu 
340 345 350 

Gly Trp Asp Asp Pro Arg Leu Gly Thr lie Arg Ala Leu Arg Arg Arg 
355 360 365 

Gly lie Leu Pro Glu Ala lie Lys Glu Leu lie lie Glu val Gly Leu 
370 375 380 

Lys Lys ser Asp Ala Thr lie Ser Trp Glu Asn Leu Ala Ala lie Asn 
385 390 395 400 

Arg Lys Leu val Asp Pro lie Ala Asn Arg Tyr Phe Phe val Ala Asp 
405 410 415 

Pro lie Pro Met Glu Val Glu Gly Ala Pro Glu Phe lie Ala Glu lie 
420 425 430 

Pro Leu His Pro Asp His Pro Glu Arg Gly val Arg Arg Leu Lys Phe 
435 440 445 

Thr Pro Glu Arg Pro val Tyr val ser Lys Asp Asp Leu Asn Leu Leu 
450 455 460 

Lys Pro Gly Asn Phe val Arg Leu Lys Asp Leu Phe Asn val Glu lie 
465 470 475 480 
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Leu Glu val Gly Asp Lys lie Arg Ala Arg Phe Tyr ser Phe Glu Tyr 
485 490 495 

Glu lie Ala Lys Lys Asn Arg Trp Lys Met val His Trp val Thr Glu 



Arg Lys Gly Leu Leu Glu Lys Asp Ala Lys val Gin val Asn Glu lie 
530 535 540 

Val Gin Phe Glu Arg Phe Gly Phe val Arg lie Asp Arg lie Glu Gly 
545 550 555 560 

Asp Lys val lie Ala lie Tyr Ala His Lys 
565 570 

<210> 78 
<211> 1728 
<212> DNA 

<213> sulfolobus sol f atari cus 
<400> 78 

ttgatggaat taaatgaatt acgagaaact atttataaat atgctcttca aaatgcggta 60 

aagcataatg ggaaagccga aacgggtcca gtgatgagta agattatagc ggagaggccg 120 

gaattaagat caaatgctag agaaattgta aagattatca aggagatcat agaacaagtt 180 

aactcattaa ctttagagca gcagttaact gaaattaaag caaagtatcc agagttatta 240 

gaagagaaga agcatgaaga aaaaagaaaa gtgttgccac ctcttcctaa tgttaaagga 300 

caagtagtaa ctagatttgc ccctaatccg gatggtcctc ttcatttagg taacgcaagg 360 

tctgcgattc tatcttacga atacgctaaa atgtataacg gtaagttcat actgaggttt 420 

gatgatacag atcctaaggt taaaaggcca attctagatg catatgattg gattaaggag 480 

gatctaaaat ggctgggaat taaatgggaa caagaacttt acgcttcgga aaggctcgag 540 

ttatactata agtacgctcg gtatttgata gaaaaaggat atgcgtatgt tgatacatgt 600 

gatagtagta ttttcaggaa gtttagggat agtagaggta aaatgaaaga gccagaatgt 660 

ttacatagga gctcttcacc agagagtaat ctggagctgt ttgaaaaaat gctaggagga 720 

aagtttaaag aaggagaagc tgtggttagg ttaaagaccg atctatcgga tccagatcca 780 

tctcaaatag attgggtaat gcttagaata attgatactg ctaaaaatcc tcatccacga 840 

gtagggagca agtattgggt atggcctacc tataactttg cttctataat agatgatcat 900 

gaacttggaa ttactcatgt actgagagct aaggagcata tgtcaaatac agaaaagcag 960 

agatacattt ctgagtacat gggatgggaa tttccagagg tcttacagtt tggaagacta 1020 

agacttgaag gttttatgat gagtaagtcg aaaataagag gaatgttaga gaagggtacc 1080 

aatagggatg atcctagatt gccaacttta gctggactca gaagaagagg aatcttacca 1140 
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gatacaatta aagacgtaat aattgatgtc ggagtaaagg taactgatgc tactataagt 1200 

tttgagaaca ttgccgcaat taataggaaa aaactagatc cggtagctaa aagaataatg 1260 

tttgtaaaag atgcagaaga atttagcgtc gagctacctg aatcactaaa tgctaaaata 1320 

cctttaattc cctctaaaca agaaatgaat agaactatta ttgtcaatcc aggagataaa 1380 

attttgatag aatctaacga tgcagaagat aatagtatac taaggttaat ggaattatgc 1440 

aatgttaaag ttgataagca taatcgtaag ctcatatttc acagcaaaac gttagacgag 1500 

gctaagaaag tcaatgcaaa gatagttcaa tgggtaaaat caaatgaaaa agtaccagta 1560 

atggtagaga aagcggaaag agatgagata aaaatgataa atggttatgc tgaaaaaata 1620 

gcagcagatt tggagataga cgaaattgta caattttacc gatttggatt tgtaagagta 1680 

gataggaaag atgagaatat gctacgtgtt gtattctcac acgactga 1728 

<210> 79 
<211> 575 
<212> PRT 

<213> sulf Globus sol f atari cus 
<400> 79 

Leu Met Glu Leu Asn Glu Leu Arg Glu Thr lie Tyr Lys Tyr Ala Leu 
15 10 15 

Gin Asn Ala val Lys His Asn Gly Lys Ala Glu Thr Gly Pro val Met 
20 25 30 

Ser Lys lie lie Ala Glu Arg Pro Glu Leu Arg Ser Asn Ala Arg Glu 
35 40 45 

lie val Lys lie lie Lys Glu lie lie Glu Gin val Asn ser Leu Thr 
50 55 60 

Leu Glu Gin Gin Leu Thr Glu lie Lys Ala Lys Tyr Pro Glu Leu Leu 
65 70 75 80 

Glu Glu Lys Lys His Glu Glu Lys Arg Lys val Leu Pro Pro Leu Pro 
85 90 95 

Asn val Lys Gly Gin val val Thr Arg Phe Ala Pro Asn Pro Asp Gly 
100 105 110 

Pro Leu His Leu Gly Asn Ala Arg Ser Ala He Leu ser Tyr Glu Tyr 
115 120 125 

Ala Lys Met Tyr Asn Gly Lys Phe lie Leu Arg Phe Asp Asp Thr Asp 
130 135 140 

Pro Lys val Lys Arg Pro lie Leu Asp Ala Tyr Asp Trp lie Lys Glu 
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Asp Leu Lys Trp Leu Gly lie Lys Trp Glu Gin Glu Leu Tyr Ala Ser 
165 170 175 

Glu Arg Leu Glu Leu Tyr Tyr Lys Tyr Ala Arg Tyr Leu He Glu Lys 
180 185 190 

Gly Tyr Ala Tyr val Asp Thr Cys Asp Ser Ser lie Phe Arg Lys Rhe 
195 200 205 

Arg Asp Ser Arg Gly Lys Met Lys Glu Pro Glu Cys Leu His Arg Ser 
210 215 220 

ser ser Pro Glu Ser Asn Leu Glu Leu Phe Glu Lys Met Leu Gly Gly 
225 230 235 240 

Lys Phe Lys Glu Gly Glu Ala Val Val Arg Leu Lys Thr Asp Leu Ser 
245 250 255 

Asp Pro Asp Pro ser Gin lie Asp Trp Val Met Leu Arg lie lie Asp 
260 265 270 

Thr Ala Lys Asn Pro His Pro Arg val Gly Ser Lys Tyr Trp val Trp 
275 280 285 

Pro Thr Tyr Asn Phe Ala Ser lie lie Asp Asp His Glu Leu Gly lie 
290 295 300 

Thr His val Leu Arg Ala Lys Glu His Met ser Asn Thr Glu Lys Gin 
305 310 315 320 

Arg Tyr lie Ser Glu Tyr Met Gly Trp Glu Phe Pro Glu val Leu Gin 
325 330 335 

Phe Gly Arg Leu Arg Leu Glu Gly Phe Met Met Ser Lys Ser Lys lie 
340 345 350 

Arg Gly Met Leu Glu Lys Gly Thr Asn Arg Asp Asp Pro Arg Leu Pro 
355 360 365 

Thr Leu Ala Gly Leu Arg Arg Arg Gly lie Leu Pro Asp Thr lie Lys 
370 375 380 

Asp val lie lie Asp val Gly val Lys val Thr Asp Ala Thr lie Ser 
385 390 395 400 

Phe Glu Asn lie Ala Ala lie Asn Arg Lys Lys Leu Asp Pro val Ala 
405 410 415 

Lys Arg lie Met Phe Val Lys Asp Ala Glu Glu Phe Ser val Glu Leu 
420 425 430 
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Pro Glu ser Leu Asn Ala Lys lie Pro Leu lie Pro ser Lys Gin Glu 
435 440 445 

Met Asn Arg Thr lie lie val Asn Pro Gly Asp Lys lie Leu lie Glu 
450 455 460 

ser Asn Asp Ala Glu Asp Asn ser lie Leu Arg Leu Met Glu Leu Cys 
465 470 475 480 

Asn val Lys val Asp Lys His Asn Arg Lys Leu lie Phe His ser Lys 
485 490 . 495 

Thr Leu Asp Glu Ala Lys Lys val Asn Ala Lys lie val Gin Trp val 
500 505 510 

Lys ser Asn Glu Lys val Pro val Met Val Glu Lys Ala Glu Arg Asp 
515 520 525 

Glu lie Lys Met lie Asn Gly Tyr Ala Glu Lys lie Ala Ala Asp Leu 
530 535 540 

Glu lie Asp Glu lie val Gin Phe Tyr Arg Phe Gly Phe val Arg val 
545 550 555 560 

Asp Arg Lys Asp Glu Asn Met Leu Arg val val Phe Ser His Asp 
565 570 575 

<210> 80 
<211> 4813 
<212> DNA 
<213> artificial 

<220> 

<223> PACKO-A184TAG 
<220> 

<221> mi sc_f eature 

<222> (3749). .(3749) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (3769) . . (3769) 

<223> n is a, c, g, or t 

<400> 80 

gaactccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60 

gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120 

ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 180 

tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 240 

aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300 

ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 360 
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ggtatcaaca 


gggacaccag 


gatttattta 


ttctgcgaag 


tgatcttccg 


tcacaggtat 


420 


ttattcggcg 


caaagtgcgt 


cgggtgatgc 


tgccaactta 


ctgatttagt 


gtatgatggt 


480 


gtttttgagg 


tgctccagtg 


gcttctgttt 


ctatcagctg 


tccctcctgt 


tcagctactg 


540 


acggggtggt 


gcgtaacggc 


aaaagcaccg 


ccggacatca 


gcgctagcgg 


agtgtatact 


600 


ggcttactat 


gttggcactg 


atgagggtgt 


cagtgaagtg 


cttcatgtgg 


caggagaaaa 


660 


aaggctgcac 


cggtgcgtca 


gcagaatatg 


tgatacagga 


tatattccgc 


ttcctcgctc 


720 


actgactcgc 


tacgctcggt 


cgttcgactg 


cggcgagcgg 


aaatggctta 


cgaacggggc 


780 


ggagatttcc 


tggaagatgc 


caggaagata 


cttaacaggg 


aagtgagagg 


gccgcggcaa 


840 


agccgttttt 


ccataggctc 


cgcccccctg 


acaagcatca 


cgaaatctga 


cgctcaaatc 


900 


agtggtggcg 


aaacccgaca 


ggactataaa 


gataccaggc 


gtttccccct 


ggcggctccc 


960 


tcgtgcgctc 


tcctgttcct 


gcctttcggt 


ttaccggtgt 


cattccgctg 


ttatggccgc 


1020 


gtttgtctca 


ttccacgcct 


gacactcagt 


tccgggtagg 


cagttcgctc 


caagctggac 


1080 


tgtatgcacg 


aaccccccgt 


tcagtccgac 


cgctgcgcct 


tatccggtaa 


ctatcgtctt 


1140 


gagtccaacc 


cggaaagaca 


tgcaaaagca 


ccactggcag 


cagccactgg 


taattgattt 


1200 


agaggagtta 


gtcttgaagt 


catgcgccgg 


ttaaggctaa 


actgaaagga 


caagttttgg 


1260 


tgactgcgct 


cctccaagcc 


agttacctcg 


gttcaaagag 


ttggtagctc 


agagaacctt 


1320 


cgaaaaaccg 


ccctgcaagg 


cggttttttc 


gttttcagag 


caagagatta 


cgcgcagacc 


1380 


aaaacgatct 


caagaagatc 


atcttattaa 


tcagataaaa 


tatttctaga 


tttcagtgca 


1440 


atttatctct 


tcaaatgtag 


cacctgaagt 


cagccccata 


cgatataagt 


tgtaattctc 


1500 


atgtttgaca 


gcttatcatc 


gataagcttt 


aatgcggtag 


tttatcacag 


ttaaattgct 


1560 


aacgcagtca 


ggcaccgtgt 


atgaaatcta 


acaatgcgct 


catcgtcatc 


ctcggcaccg 


1620 


tcaccctgga 


tgctgtaggc 


ataggcttgg 


ttatgccggt 


actgccgggc 


ctcttgcggg 


1680 


atatcggttt 


cttagacgtc 


aggtggcact 


tttcggggaa 


atgtgcgcgg 


aacccctatt 


1740 


tgtttatttt 


tctaaataca 


ttcaaatatg 


tatccgctca 


tgagacaata 


accctgataa 


1800 


atgcttcaat 


aatattgaaa 


aaggaagagt 


atgagtattc 


aacatttccg 


tgtcgccctt 


1860 


attccctttt 


ttgcggcatt 


ttgccttcct 


gtttttgctc 


acccagaaac 


gctggtgaaa 


1920 


gtaaaagatg 


ctgaagatca 


gttgggtgca 


cgagtgggtt 


acatcgaact 


ggatctcaac 


1980 


agcgqtaaga 


tccttgagag 


ttttcgcccc 


gaagaacgtt 


ttccaatgat 


gagcactttt 


2040 


aaagttctgc 


tatgtggcgc 


ggtattatcc 


cgtgttgacg 


ccgggcaaga 


gcaactcggt 


2100 


cgccgcatac 


actattctca 


gaatgacttg 


gttgagtact 


caccagtcac 


agaaaagcat 


2160 


cttacggatg 


gcatgacagt 


aagagaatta 


tgcagtgctg 


ccataaccat 


gagtgataac 


2220 


actgcggcca 


acttacttct 


gacaacgatc 


ggaggaccga 


aggagctaac 


cgcttttttg 


2280 


cacaacatgg 


gggatcatgt 


aactcgcctt 


gatcgttggg 


aaccggagct 


gaatgaagcc 


2340 


ataccaaacg 


acgagcgtga 


caccacgatg 


ccttaggcaa tggcaacaac 
Page 32 


gttgcgcaaa 


2400 



54-000510US ■ ST2 5 . txt 



ctattaactg 


gcgaactact 


tactctagct 


tcccggcaac 


aattaataga 


ctggatggag 


2460 


gcggataaag 


ttgcaggacc 


acttctgcgc 


tcggcccttc 


cggctggctg 


gtttattgct 


2520 


gataaatctg 


gagccggtga 


gcgtgggtct 


cgcggtatca 


ttgcagcact 


ggggccagat 


2580 


ggtaagccct 


cccgtatcgt 


agttatctac 


acgacgggga 


gtcaggcaac 


tatggatgaa 


2640 


cgaaatagac 


agatcgctga 


gataggtgcc 


tcactgatta 


agcattggca 


ccaccaccac 


2700 


caccactaac 


ccgggaccaa 


gtttactcat 


atatacttta 


gattgattta 


aaacttcatt 


2760 


tttaatttaa 


aaggatctag 


gtgaagatcc 


tttttgataa 


tctcatgacc 


aaaatccctt 


2820 


aacggcatgc 


accattcctt 


gcggcggcgg 


tgctcaacgg 


cctcaaccta 


ctactgggct 


2880 


gcttcctaat 


gcaggagtcg 


cataagggag 


agcgtctggc 


gaaaggggga 


tgtgctgcaa 


2940 


ggcgattaag 


ttgggtaacg 


ccagggtttt 


cccagtcacg 


acgttgtaaa 


acgacggcca 


3000 


gtgccaagct 


taaaaaaaat 


ccttagcttt 


cgctaaggat 


ctgcagttat 


aatctctttc 


3060 


taattggctc 


taaaatcttt 


ataagttctt 


cagctacagc 


attttttaaa 


tccattggat 


3120 


gcaattcctt 


atttttaaat 


aaactctcta 


actcctcata 


gctattaact 


gtcaaatctc 


3180 


caccaaattt 


ttctggcctt 


tttatggtta 


aaggatattc 


aaggaagtat 


ttagctatct 


3240 


ccattattgg 


atttccttca 


acaactccag 


ctgggcagta 


tgctttcttt 


atcttagccc 


3300 


taatctcttc 


tggagagtca 


tcaacagcta 


taaaattccc 


ttttgaagaa 


ctcatctttc 


3360 


cttctccatc 


caaacccgtt 


aagacagggt 


tgtgaataca 


aacaaccttt 


tttggtaaaa 


3420 


gctcccttgc 


taacatgtgt 


atttttctct 


gctccatccc 


tccaactgca 


acatcaacgc 


3480 


ctaaataatg 


aatatcatta 


acctgcatta 


ttggatagat 


aacttcagca 


acctttggat 


3540 


tttcatcctc 


tcttgctata 


agttccatac 


tccttcttgc 


tctttttaag 


gtagttttta 


3600 


aagccaatct 


atagacattc 


agtgtataat 


ccttatcaag 


ctggaattca 


gcgttacaag 


3660 


tattacacaa 


agttttttat 


gttgagaata 


tttttttgat 


ggggcgccac 


ttatttttga 


3720 


tcgttcgctc 


aaagaagcgg 


cgccagggnt 


gtttttcttt 


tcaccagtna 


gacgggcaac 


3780 


agaacgccat 


gagcggcctc 


atttcttatt 


ctgagttaca 


acagtccgca 


ccgctgtccg 


3840 


gtagctcctt 


ccggtgggcg 


cggggcatga 


ctatcgtcgc 


cgcacttatg 


actgtcttct 


3900 


ttatcatgca 


actcgtagga 


caggtgccgg 


cagcgcccaa 


cagtcccccg 


gccacggggc 


3960 


ctgccaccat 


acccacgccg 


aaacaagcgc 


cctgcaccat 


tatgttccgg 


atctgcatcg 


4020 


caggatgctg 


ctggctaccc 


tgtggaacac 


ctacatctgt 


attaacgaag 


cgctaaccgt 


4080 


ttttatcagg 


ctctgggagg 


cagaataaat 


gatcatatcg 


tcaattatta 


cctccacggg 


4140 


gagagcctga 


gcaaactggc 


ctcaggcatt 


tgagaagcac 


acggtcacac 


tgcttccggt 


4200 


agtcaataaa 


ccggtaaacc 


agcaatagac 


ataagcggct 


atttaacgac 


cctgccctga 


4260 


accgacgacc 


gggtcgaatt 


tgctttcgaa 


tttctgccat 


tcatccgctt 


attatcactt 


4320 


attcaggcgt 


agcaccaggc 


gtttaagggc 


accaataact 


gccttaaaaa 


aattacgccc 


4380 


cgccctgcca 


ctcatcgcag 


tactgttgta 


attcattaag cattctgccg 
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acatggaagc 


4440 
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catcacagac 


ggcatgatga 


acctgaatcg 


ccagcggcat 


cagcaccttg 


tcgccttgcg 


4500 


tataatattt 


gcccatggtg 


aaaacggggg 


cgaagaagtt 


gtccatattg 


gccacgttta 


4560 


aa LCadadC L 


^ /-»•♦-/-« -3 -3 -a ^ -4- /~ 

ggxgaaac LC 


0 f~ f~ f~ ry r\ ^ "t~ 

acccaygy cL L 


Lgg^^g^y^^ 




L L^LL.ddLdd 




accctttagg 


gaaataggcc 


aggttttcac 


cgtaacacgc 


cacatcttgc 


gaatatatgt 


4680 


gtagaaactg 


ccggaaatcg 


tcgtggtatt 


cactccagag 


cgatgaaaac 


gtttcagttt 


4740 


gctcatggaa 


aacggtgtaa 


caagggtgaa 


cactatccca 


tatcaccagc 


tcaccgtctt 


4800 


tcattgccat 


acg 










4813 



<210> 81 

<211> 4338 

<212> DNA 

<213> artificial 
<220> 

<223> pACKO-Bla 
<220> 

<221> mi sc_feature 

<222> (3274) . .(3274) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (3294) . . (3294) 

<223> n is a, c, g, or t 

<400> 81 



gaactccgga 


tgagcattca 


tcaggcgggc 


aagaatgtga 


ataaaggccg 


gataaaactt 


60 


gtgcttattt 


ttctttacgg 


tctttaaaaa 


ggccgtaata 


tccagctgaa 


cggtctggtt 


120 


ataggtacat 


tgagcaactg 


actgaaatgc 


ctcaaaatgt 


tctttacgat 


gccattggga 


180 


tatatcaacg 


gtggtatatc 


cagtgatttt 


tttctccatt 


ttagcttcct 


tagctcctga 


240 


aaatctcgat 


aactcaaaaa 


atacgcccgg 


tagtgatctt 


atttcattat 


ggtgaaagtt 


300 


ggaacctctt 


acgtgccgat 


caacgtctca ttttcgccaa 


aagttggccc 


agggcttccc 


360 


ggtatcaaca 


gggacaccag 


gatttattta ttctgcgaag 


tgatcttccg 


tcacaggtat 


420 


ttattcggcg 


caaagtgcgt 


cgggtgatgc 


tgccaactta 


ctgatttagt 


gtatgatggt 


480 


gtttttgagg 


tgctccagtg 


gcttctgttt 


ctatcagctg 


tccctcctgt 


tcagctactg 


540 


acggggtggt 


gcgtaacggc 


aaaagcaccg 


ccggacatca 


gcgctagcgg 


agtgtatact 


600 


ggcttactat 


gttggcactg 


atgagggtgt 


cagtgaagtg 


cttcatgtgg 


caggagaaaa 


660 


aaggctgcac 


cggtgcgtca 


gcagaatatg 


tgatacagga 


tatattccgc 


ttcctcgctc 


720 


actgactcgc 


tacgctcggt 


cgttcgactg 


cggcgagcgg 


aaatggctta 


cgaacggggc 


780 


ggagatttcc 


tggaagatgc 


caggaagata 


cttaacaggg 


aagtgagagg 


gccgcggcaa 


840 


agccgttttt 


ccataggctc 


cgcccccctg 


acaagcatca 


cgaaatctga 


cgctcaaatc 


900 


agtggtggcg 


aaacccgaca 


ggactataaa gataccaggc 


gtttccccct 


ggcggctccc 


960 
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tcgtgcgctc 


tcctgttcct 


gcctttcggt 


ttaccggtgt 


cattccgctg 


ttatggccgc 


1020 


gtttgtctca 


ttccacgcct 


gacactcagt 


tccgggtagg 


cagttcgctc 


caagctggac 


1080 


tgtatgcacg 


aaccccccgt 


tcagtccgac 


cgctgcgcct 


tatccggtaa 


ctatcgtctt 


1140 


gagtccaacc 


cggaaagaca 


tgcaaaagca 


ccactggcag 


cagccactgg 


taattgattt 


1200 


agaggagtta 


gtcttgaagt 


catgcgccgg 


ttaaggctaa 


actgaaagga 


caagttttgg 


1260 


tgactgcgct 


cctccaagcc 


agttacctcg 


gttcaaagag 


ttggtagctc 


agagaacctt 


1320 


cgaaaaaccg 


ccctgcaagg 


cggttttttc 


gttttcagag 


caagagatta 


cgcgcagacc 


1380 


aaaacgatct 


caagaagatc 


atcttattaa 


tcagataaaa 


tatttctaga 


tttcagtgca 


1440 


atttatctct 


tcaaatgtag 


cacctgaagt 


cagccccata 


cgatataagt 


tgtaattctc 


1500 


atgtttgaca 


gcttatcatc 


gataagcttt 


aatgcggtag 


tttatcacag 


ttaaattgct 


1560 


aacgcagtca 


ggcaccgtgt 


atgaaatcta 


acaatgcgct 


catcgtcatc 


ctcggcaccg 


1620 


tcaccctgga 


tgctgtaggc 


ataggcttgg 


ttatgccggt 


actgccgggc 


ctcttgcggg 


1680 


atatcggttt 


cttagacgtc 


aggtggcact 


tttcggggaa 


atgtgcgcgg 


aacccctatt 


1740 


tgtttatttt 


tctaaataca 


ttcaaatatg 


tatccgctca 


tgagacaata 


accctgataa 


1800 


atgcttcaat 


aatattgaaa 


aaggaagagt 


atgagtattc 


aacatttccg 


tgtcgccctt 


1860 


attccctttt 


ttgcggcatt 


ttgccttcct 


gtttttgctc 


acccagaaac 


actagtgcag 


1920 


caatggcaac 


aacgttgcgc 


aaactattaa 


ctggcgaact 


acttactcta 


gcttcccggc 


1980 


aacaattaat 


agactggatg 


gaggcggata 


aagttgcagg 


accacttctg 


cgctcggccc 


2040 


ttccggctgg 


ctggtttatt 


gctgataaat 


ctggagccgg 


tgagcgtggg 


tctcgcggta 


2100 


tcattgcagc 


actggggcca 


gatggtaagc 


cctcccgtat 


cgtagttatc 


tacacgacgg 


2160 


ggagtcaggc 


aactatggat 


gaacgaaata 


gacagatcgc 


tgagataggt 


gcctcactga 


2220 


ttaagcattg 


gtaacccggg 


accaagttta 


ctcatatata 


ctttagattg 


atttaaaact 


2280 


tcatttttaa 


tttaaaagga 


tctaggtgaa 


gatccttttt 


gataatctca 


tgaccaaaat 


2340 


cccttaacgg 


catgcaccat 


tccttgcggc 


ggcggtgctc 


aacggcctca 


acctactact 


2400 


gggctgcttc 


ctaatgcagg 


agtcgcataa 


gggagagcgt 


ctggcgaaag 


ggggatgtgc 


2460 


tgcaaggcga 


ttaagttggg 


taacgccagg 


gttttcccag 


tcacgacgtt 


gtaaaacgac 


2520 


ggccagtgcc 


aagcttaaaa 


aaaatcctta 


gctttcgcta 


aggatctgca 


gttataatct 


2580 


ctttctaatt 


ggctctaaaa 


tctttataag 


ttcttcagct 


acagcatttt 


ttaaatccat 


2640 


tggatgcaat 


tccttatttt 


taaataaact 


ctctaactcc 


tcatagctat 


taactgtcaa 


2700 


atctccacca 


aatttttctg 


gcctttttat 


ggttaaagga 


tattcaagga 


agtatttagc 


2760 


tatctccatt 


attggatttc 


cttcaacaac 


tccagctggg 


cagtatgctt 


tctttatctt 


2820 


agccctaatc 


tcttctggag 


agtcatcaac 


agctataaaa 


ttcccttttg 


aagaactcat 


2880 


ctttccttct 


ccatccaaac 


ccgttaagac 


agggttgtga 


atacaaacaa 


ccttttttgg 


2940 


taaaagctcc 


cttgctaaca 


tgtgtatttt 


tctctgctcc 


atccctccaa 


ctgcaacatc 


3000 
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taatgaatat 
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cattattgga tagataactt 


cagcaacctt 


3060 


tggattttca 


tcctctcttg 


ctataagttc 


catactcctt 


cttgctcttt 


ttaaggtagt 


3120 


ttttaaagcc 


aatctataga 


cattcagtgt 


ataatcctta 


tcaagctgga 


attcagcgtt 


3180 


acaagtatta 


cacaaagttt 


tttatgttga 


gaatattttt 


ttgatggggc 


gccacttatt 


3240 


tttgatcgtt 


cgctcaaaga 


aqcqqcqcca 


gggntgtttt 


tcttttcacc 


agtnagacgg 


3300 


gcaacagaac 


accataaQcq 


gcctcatttc 


ttattctgag 


ttacaacagt 


ccgcaccgct 


3360 


Qtccaataac 


tccttccggt 


qqqcqcqqqq 

3J 3J 33 j7 ^ 3 3J 57 


catgactatc 


gtcgccgcac 


ttatgactgt 


3420 


cttctttatc 


atgcaactcg 


taggacaggt 


gccggcagcg 


cccaacagtc 


ccccggccac 


3480 


aaaacctacc 


accataccca 


cgccgaaaca 


agcgccctgc 


accattatgt 


tccggatctg 


3540 


catcgcagga 


tactactqqc 


taccctgtgg 


aacacctaca 


tctgtattaa 


cgaagcgcta 


3600 


accgt'tttta 


tcaggctctg 


ggaggcagaa 


taaatgatca 


tatcgtcaat 


tattacctcc 


3660 




cctgagcaaa 


ctggcctcag 


gcatttgaga 


agcacacggt 


cacactgctt 


3720 


ccaataa'tca 


ataaaccggt 


aaaccagcaa 


tagacataag 


cggctattta 


acgaccctgc 


3780 


cctgaaccga 


cgaccgggtc 


gaatttgctt 


tcgaatttct 


gccattcatc 


cgcttattat 


3840 


cact'ta'ttca 


ggcgtagcac 


caggcgttta 


agggcaccaa 


taactgcctt 


aaaaaaatta 


3900 


coccccaccc 

\^ V# V# V* %0 


tgccactcat 


cgcagtactg 


ttgtaattca 


ttaagcattc 


tgccgacatg 


3960 


aaaaccatca 


caaacQQcat 


aataaaccta 


aatcgccagc 


ggcatcagca 


ccttgtcgcc 


4020 


t "to cat ataa 


tatttaccca 


taataaaaac 


gggggcgaag 


aagttgtcca 


tattggccac 


4080 


a "t "t "t aaat ca 


aaactaataa 


aactcaccca 


yyy" *- *-yy^ ^ 


y ay civ.y CLCLCLci 


acatattctc 


4140 


aataaaccct 


ttagggaaat 


aggccaggtt 


ttcaccgtaa 


cacgccacat 


cttgcgaata 


4200 


tatgtgtaga 


aactgccgga 


aatcgtcgtg 


gtattcactc 


cagagcgatg 


aaaacgtttc 


4260 


agtttgctca 


tggaaaacgg 


tgtaacaagg 


gtgaacacta 


tcccatatca 


ccagctcacc 


4320 


gtctttcatt 


gccatacg 










4338 


<210> 82 
<211> 2271 
<212> DNA 
<213> artificial 












<220> 

<223> pKQ 














<400> 82 
atggatccga 


gctcgagatc 


tqcaqctqqt 


accatatggg 


aattcgaagc 


ttgggcccga 


60 


acaaaaactc 


atctcagaag 


aggatctgaa 


tagcgccgtc 


gaccatcatc 


atcatcatca 


120 


ttgagtttaa 


acggtctcca 


gcttggctgt 


tttggcggat 


gagagaagat 


L L Lcay Lcxy 


1 so 


atacagatta 


aatcagaacg 


cagaagcggt 


ctgataaaac 


agaatttgcc 


tggcggcagt 


240 


agcgcggtgg 


tcccacctga 


ccccatgccg 


aactcagaag 


tgaaacgccg 


tagcgccgat 


300 


ggtagtgtgg 


ggtctcccca 


tgcgagagta 


gggaactgcc 


aggcatcaaa 


taaaacgaaa 


360 
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ggctcagtcg 


aaagactggg 


cctttcgttt 


tatctgttgt 


ttgtcggtga 


acgatatctg 


420 


cttttcttcg 


cgaattaatt 


ccgcttcgca 


acatgtgagc 


aaaaggccag 


caaaaggcca 


480 


ggaaccgtaa 


aaaggccgcg 


ttgctggcgt 


ttttccatag 


gctccgcccc 


cctgacgagc 


540 


atcacaaaaa 


tcgacgctca 


agtcagaggt 


ggcgaaaccc 


gacaggacta 


taaagatacc 


600 


aggcgtttcc 


ccctggaagc 


tccctcgtgc 


gctctcctgt 


tccgaccctg 


ccgcttaccg 


660 


gatacctgtc 


cgcctttctc 


ccttcgggaa 


gcgtggcgct 


ttctcatagc 


tcacgctgta 


720 


ggtatctcag 


ttcggtgtag 


gtcgttcgct 


ccaagctggg 


ctgtgtgcac 


gaaccccccg 


780 


ttcagcccga 


ccgctgcgcc 


ttatccggta 


actatcgtct 


tgagtccaac 


ccggtaagac 


840 


acgacttatc 


gccactggca 


gcagccactg 


gtaacaggat 


tagcagagcg 


aggtatgtag 


900 


gcggtgctac 


agagttcttg 


aagtggtggc 


ctaactacgg 


ctacactaga 


aggacagtat 


960 


ttggtatctg 


cgctctgctg 


aagccagtta 


ccttcggaaa 


aagagttggt 


agctcttgat 


1020 


ccggcaaaca 


aaccaccgct 


ggtagcggtg 


gtttttttgt 


ttgcaagcag 


cagattacgc 


1080 


gcagaaaaaa 


aggatctcaa 


gaagatcctt 


tgatcttttc 


tacggggtct 


gacgctcagt 


1140 


ggaacgaaaa 


ctcacgttaa 


gggattttgg 


tcatgagttg 


tgtctcaaaa 


tctctgatgt 


1200 


tacattgcac 


aagataaaaa 


tatatcatca 


tgaacaataa 


aactgtctgc 


ttacataaac 


1260 


agtaatacaa 


ggggtgttat 


gagccatatt 


caacgggaaa 


cgtcttgctc 


gaggccgcga 


1320 


ttaaattcca 


acatggatgc 


tgatttatat 


gggtataaat 


gggctcgcga 


taatgtcggg 


1380 


caatcaggtg 


cgacaatcta 


tcgattgtat 


gggaagcccg 


atgcgccaga 


gttgtttctg 


1440 


aaacatggca 


aaggtagcgt 


tgccaatgat 


gttacagatg 


agatggtcag 


actaaactgg 


1500 


ctgacggaat 


ttatgcctct 


tccgaccatc 


aagcatttta 


tccgtactcc 


tgatgatgca 


1560 


tggttactca 


ccactgcgat 


ccccgggaaa 


acagcattcc 


aggtattaga 


agaatatcct 


1620 


gattcaggtg 


aaaatattgt 


tgatgcgctg 


gcagtgttcc 


tgcgccggtt 


gcattcgatt 


1680 


cctgtttgta 


attgtccttt 


taacagcgat 


cgcgtatttc 


gtctcgctca 


ggcgcaatca 


1740 


cgaatgaata 


acggtttggt 


tgatgcgagt 


gattttgatg 


acgagcgtaa 


tggctggcct 


1800 


gttgaacaag 


tctggaaaga 


aatgcataag 


cttttgccat 


tctcaccgga 


ttcagtcgtc 


1860 


actcatggtg 


atttctcact 


tgataacctt 


atttttgacg 


aggggaaatt 


aataggttgt 


1920 


attgatgttg 


gacgagtcgg 


aatcgcagac 


cgataccagg 


atcttgccat 


cctatggaac 


1980 


tgcctcggtg 


agttttctcc 


ttcattacag 


aaacggcttt 


ttcaaaaata 


tggtattgat 


2040 


aatcctgata 


tgaataaatt 


gcagtttcat 


ttgatgctcg 


atgagttttt 


ctaatcagaa 


2100 


ttggttaatt 


ggttgtaaca 


ctggcagagc 


attacgctga 


cttgacggga 


cggcggcttt 


2160 


gttgaataaa 


tcgaactttt 


gctgagttga 


aggatcctcg 


ggagttgtca 


gcctgtcccg 


2220 


cttataagat 


catacgccgt 


tatacgttgt 


ttacgctttg 


aggaattaac 


c 


2271 



<210> 83 
<211> 38 
<212> DMA 
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<213> artificial 
<220> 

<223> oligonucleotide primer 
<400> 83 

ggagagaaga ctacatgaaa gaagtcataa tgaaatac 38 



<210> 84 

<211> 38 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 84 

cgttagaaga ctgaattcgg attttggtca cgggtgcg 38 



<210> 85 

<211> 36 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 85 

gctatcacac catggcgatg ctcctagggg accatc 36 



<210> 86 

<211> 40 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 86 

gatcagtcag aattcttact tgtgtgtgta tatcaccctg 40 



<210> 87 

<211> 33 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 87 

ggagacgtct cacatggaag agaagatatt gcc 



33 



<210> 88 

<211> 36 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 88 

cgttacgtct cgaattttat ctatgtgcat agcaac 36 
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<210> 89 
<211> 33 
<212> DNA 
<213> artificial 

<220> 

<223> oligonucleotide primer 
<400> 89 

ggagacgtct cacatgacct taagtcctga aga 



<210> 90 

<211> 43 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 90 

cgttacgtct cgaattttat ttatgagcaa agtacgccac gac 



<210> 91 

<211> 33 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 91 

ggagacgtct ctcatggtgc cagtggagga cct 



<210> 92 

<211> 38 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 92 

cgttacgtct ctaattttat ttgtgggcat agtagaag 



<210> 93 

<211> 33 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 93 

ggaaaggtct ctcatggatg tggaaaagat age 



<210> 94 

<211> 36 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 
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<400> 94 

gagtaggtct ctaattttac ttgtgagcat atattg 36 

<210> 95 

<211> 36 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 
<400> 95 

gctactgcac catggaatta aatgaattac gagaac 36 

<210> 96 

<211> 38 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 96 

gactcagtag aattcattag tcgtgtgaga atacaaca 38 

<210> 97 

<211> 48 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 
<400> 97 

cggaattcgc cccggtggtg tagcccggcc aagcatgcgg gcctctaa 48 

<210> 98 

<211> 67 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 
<400> 98 

aaactgcagt ggtgccccgg ccgggatttg aacccgggtc gcgggcttta gaggcccgca 60 
tgcttgg 67 

<210> 99 

<211> 58 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 
<400> 99 

cggaattcag ccccgtggtg tagcggccaa gcatgcgggg ctctaaaccc cgcgaccg 58 



<210> 100 
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<211> 
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<220> 
<223> 
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55 
DNA 

artificial 



oligonucleotide primer 



<400> 100 

aaactgcagt ggtagccccg cggggattcg aaccccggtc gcggggttta gagcc 



55 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



101 

58 

DNA 

artificial 



oligonucleotide primer 



<400> 101 

cggaattcag ccccgtggtg tagcggccaa gcacgcgggg ctctaaaccc cgcgaccg 



58 



<210> 102 

<211> 55 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 102 

aaactgcagt ggtagccccg cggggattcg aaccccggtc gcggggttta gagcc 



55 



<210> 103 

<211> 48 

<212> DNA 

<213> arti f i ci al 

<220> 

<223> oligonucleotide primer 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



mi sc_feature 
(18) . . (18) 
n is a, c, g, 



or t 



misc_feature 

(36).. (36) 

n is a, c, g, or t 



<400> 103 

cggaattcgc cccggtgntg tagcccggcc aagcangcgg gcctctaa 



48 



<210> 104 

<211> 18 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 104 
ggacagcggt gcggactg 



18 



Page 41 



54-000510US . ST2 5 . txt 



<210> 105 

<211> 18 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 105 

gtgctcaacg gcctcaac 18 

<210> 106 

<211> 38 

<212> DNA 

<213> artificial 

<220> 

<223> oligonucleotide primer 

<400> 106 

aggcttttaa ccatggcaaa aatcaaaact cgcttcgc 38 
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